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Abstract. In order to investigate corrections to the common KdV ap- 
proximation for surface water waves in a canal, we derive modulation 
equations for the evolution of long wavelength initial data. We work 
in Lagrangian coordinates. The equations which govern corrections to 
the KdV approximation consist of linearized and inhomogeneous KdV 
^ I equations plus an inhomogeneous wave equation. These equations are 

explicitly solvable and we prove estimates showing that they do indeed 
give a significantly better approximation than the KdV equation alone. 
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pq . 1. Introduction 

> ' 

(•f^ I It is often easier to write down a partial differential equation which models 

^^ ■ a physical phenomena than it is to study solutions of such an equation. 

''^ I Equations which model the evolution of the surface of a fluid in a canal 

(<— ^ ' have been known since at least the 19th century, however it has only been 

Tij- . in recent years that questions of existence and uniqueness for general initial 

O I data have been answered (see [HH]) [SHI and [23 )• Moreover, numerical 

,J^ ■ simulations of water waves and similarly complex phenomena are frequently 

j^ , time consuming and challenging to implement. Consequently, it can be 

quite difficult to say much about the behavior of a general solution. And 
so scientists often restrict their attention to limiting cases — for instance, 
one may assume that solutions are of long wavelength and small amplitude 
/\ • (see Figure ^). Under such a supposition, a modulation equation may be 

(formally) derived. In particular, one hopes that the modulation equation: 

• is well-posed, 

• is either explicitly solvable or easy to solve numerically and 

• captures the essential behavior of the original system. 

Remarkably, many seemingly disparate physical phenomena possess modula- 
tion equations of the same form. For solutions of long wavelength, Korteweg- 
de Vries (KdV) equations are often used as modulation equations for a wide 
variety of non-linear dispersive systems, including the water wave equation, 
the Euler-Poisson equations for plasma dynamics and the Fermi-Pasta-Ulam 
equation for the interaction of particles in an infinite lattice. 

Despite the fact that modulation equations have been in use for over a 
hundred years — the KdV equation was first proposed as a model for water 
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Figure 1. The long wave, small amplitude scaling. 



waves by Boussinesq in 1872 and also by Korteweg and de Vries in 1895 — 
only recently have attempts been made to rigorously connect the behavior 
of the modulation equations to the original physical problem. In particular, 
through the work of Craig [HI, Kano and Nishida ^7j |18| . Kalyakin J16j . 
Schneider ,30;, Ben Youssef and Colin jTj and Schneider and Wayne f^ . 
PH] , the validity of KdV equations as a leading order approximation to the 
evolution of long wavelength water waves and to a number of other dispersive 
partial differential equations has been established. 

In many respects, the KdV equation is an ideal modulation equation; it is 
simple in form and explicitly solvable via the inverse scattering transform. 
Nevertheless, both experimentally and numerically one observes deviations 
from the predictions of the KdV approximation. In this paper we derive 
a hierarchy of modulation equations which govern corrections to the KdV 
model and also prove rigorously that these higher order equations do indeed 
improve the accuracy of the approximation. While the correction is valid in 
general long wavelength/small amplitude settings, heuristically the model 
is set up to better approximate interactions between solitary waves — both 
counterpropagating collisions and unidirectional interactions. 

Note that in |3S], as a case study, Wayne and Wright examined higher 
order corrections to the KdV approximation to a Boussinesq equation. As 
the KdV equation is in some sense a universal approximation for long waves, 
we expect that the equations for corrections to this approximation will also 
be universal. Indeed, our results show that the higher order corrections for 
the water wave equation are nearly identical to those for the Boussinesq 
equation. Also, since a significant part of the work for the Boussinesq prob- 
lem consists of showing that the modulation equations have well-behaved 
solutions over the time scales of interest, this is of use in tackling the water 
wave problem. 

We now describe our results in some detail. The equations of motion for 
a water wave in an infinitely long canal (commonly called the water wave 
equation) are 

xttil + Xa) + ya(l + ytt) = 
(WW) yt = K{x,y)xt 

a G M, i > 0, (x(Q,t),y(a,t)) G M^ 
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-2dTU = ^d}_U + -dfs_{U^) 
2dTV = ld}^V + ld^4V'^ 



Figure 2. Sketch of the right and left moving wave trains. 

where K{x,y) is a complicated operator (see Section El for the definition of 
K) and (a + x{a,t),y{a,t)) parameterizes the free surface. According to 
the KdV approximation results of [HJ, to the order of the approximation 
long wavelength solutions of HWW|) split up into two pieces, one a right 
moving wave train and one a left moving wave train. Each of these wave 
trains evolves according to a KdV equation, and there is no interaction 
between the left and right moving pieces. That is, for < e ^ 1, if we 
scale amplitudes to be O(e^) {i.e. small) and wavelengths to be 0(e^^) (i.e. 
long), then for times of 0(e^^), solutions to (|WW|) satisfy 

(1) -Xa{a, t) = e^U{e{a - t),eh) + eV(e(a + t), e^t) + O(e^) 

where U and V satisfy the KdV equations 

1„, 3 

-2dTU = 

(KdV) 

See Figure|21 Here, l3± = e(/3ibt) represent long wavelength moving reference 
frames and T = e^t is the very long time scale coordinate. For technical 
reasons (which we discuss later), —Xa is the natural variable to estimate for 
the water wave equation. To lowest order, — Xq, is proportional to the height 
of the wave. At higher order, this ceases to be true, though for purposes of 
intuition one can think of — Xq, as representing the wave amplitude. 

The KdV equation was initially derived from the water wave equation in 
an attempt to prove the existence of a solitary wave solution for waves in a 
canal. Famously, the KdV equation admits solitary wave solutions and also 
multi-soliton solutions. (See Figure IH]). We will frequently refer to multi- 
soliton solutions as "overtaking wave" collisions. We remind the reader that 
the only notable first order effect after such a collision is that the waves are 
phase shifted after the collision. 

Given the results in jHJ, one expects to see similar behavior in solutions 
systems modeled by KdV equations. Though it is unknown if these soliton- 
like solutions persist globally, analogous behavior is indeed observed for very 
long times (see ^1]). The most notable deviation between true solutions and 
the KdV approximation is the size of the phase shift after a collision. In 
addition, soliton-like solutions to the type of systems we study frequently 
develop a very small amplitude dispersive wave train behind each soliton. 
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before collision 



during interaction 



after collision 



* dashed lines are KdV approximation 
Figure 3. Sketch of the overtaking wave interaction. 
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Figure 4. Sketch of the dispersive wave. 



which moves in the same direction, see Figure 01 The KdV approximation 
does not predict the existence of these dispersive wave trains. As these sorts 
of discrepancies are observed even in the case where there is only one wave 
train moving unidirectionally, we believe that they are, loosely, independent 
of interactions between the left and right moving wave trains. They reflect 
intrinsic differences between the approximation and the original system. 

On the other hand, there is evidence that a noticeable interaction takes 
place between the left and right moving waves. One can see from the form of 
the approximation in equation (^ that during a head-on collision of waves 
moving in opposite directions, the KdV approximation predicts that the 
heights of the waves add linearly. In true head-on collisions in solutions to 
the water wave equation, however, the height of the waves is slightly differ- 
ent from the sum of the heights of the waves taken separately — it is slightly 
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before collision 



water wave 




during collision ^dV approximation 

Figure 5. Sketch of the head-on collision. 

larger. (See the works of Maxworthy [21], Byatt-Smith [H] [7], Cooker, Wei- 
dman and Bale 0|, and Su and Mirie [3^ 133 •) We sketch this in Figure |SJ 

Thus we might expect two types of corrections to the KdV approximation: 

• corrections due to the fact that, even in the case of a purely right 
(or left) moving wave train, solutions to the water wave equation 
are not exactly described by solutions to the KdV equation. We will 
refer to this source of error as unidirectional error. 

• corrections due the fact that the left and right moving wave trains 
will interact at higher order. We call such errors counterpropagation 
error. 

Both of these types of corrections are apparent in our results, and to incor- 
porate these two types of corrections, we add an additional three functions 
to the KdV wave trains. The first two, F and G, will correct for unidirec- 
tional errors. The third, P, will correct for counterpropagation errors. We 
scale the amplitudes of these three functions so they are O(e^), which is the 
same as the order of the error in using only the KdV equations. F and G 
will take the same functional form as U and V, as they correct for differences 
between the approximate and actual wave trains. So we add 

e^F(e(a - t), eH) + e^G(e(a + t), e^t) 

to the first order approximation Q. 

We do not expect P to be moving strictly left or right, as it corrects for 
the interaction between waves moving in opposite directions. So its spatial 
dependence will be on /? = ea. Suppose that the functions U and V are 
solitary wave solutions. In the long wavelength variables we are considering, 
this means that the right and left moving wave packets are large only over a 
length of 0(e~^). In addition, the reference frame moves with unit velocity. 
Thus we expect any interaction of the two waves to last a time of 0(e~^). 
Accordingly, we let P depend on the time variable r = et. That is we add 
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a correction term of the form 

e^P(ea, et) 

to the KdV model. 

Through formal means we find that P satisfies an inhomogeneous wave 
equation 

(IW) a^P - djP = 3dj {U{P - r, eV)y(/3 + r, eV)) . 

Similarly, F and G satisfy a pair of driven, linearized KdV equations 

1 
(LK) 



-2dTF =^dl_F + 3a^_ {UF) + r 



2dTG =^-dl^G + 2,dp^{VG) + J+. 



1 
t 

Notice that these equations are linearized about the KdV solutions U and 
V . The inhomogeneous terms J~ and J"*" are made up of a combination of 
sums and products of U , V and P. For explicit forms of these driving terms 
see the equations (|22|) . Linearized KdV equations are explicitly solvable, 
though this is a complicated matter (see [2Z| and US])- However, solutions 
are simple to compute numerically. 

One can solve inhomogeneous wave equations explicitly and easily via the 
method of characteristics. Moreover, we can reduce such systems to a pair 
of transport equations by the following fact: 



Fact 1. // dtf — dxf = l/2dxh and dtg + dxQ = —l/2dxh, then q = f + g 
Thus, we have 



satisfies dfq — d'^q = d'^h. 



(T) 
where 



P- + Pp = -\dp{U{(5 - r, e\)V{(5 + r, eV)) 
P+ - P+ = hp{U{l5 - r, e\)V{l5 + r, eV)) 



P(/3,T) = P+(/3,T) + p-(/3,r). 

We remark that the initial data for the modulation equations is deter- 
mined from initial conditions for the original system in ways described in 
Section |H1 Also, this hierarchy of higher order modulation equations is 
nearly identical to that derived in Wayne and Wright |SE1 for the Boussinesq 
equation — the chief difference lying in the specific forms of the inhomoge- 
neous terms J . 

To enforce the notion of spatial localization, we will be considering initial 
data which is of rapid decay, that is, initial data in 

H\m) = {/(a)l(l + a2)W2;(«) ^ ^s| 
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The inner product on H^[m) is given by 

(/(•),5(-))/..M = ((1 + •'r/V(-), (1 + 'r''9{-))^^ , 

where we use the standard inner product in H^. In particular, the known 
sohton solutions of the KdV equations are in such spaces. 

That the KdV equations have solutions for all times with this sort of 
initial data is well known. In particular, we have from [3J| 

Theorem 1. Let cr > 4. Then for all Ci,Tq > there exists Ci > such 
that ifU, V satisfy HKdV}) with initial conditions Uq, Vq and 

(2) max{||f/olU<^(4)nH<'+4(2)nfl'<^+9) \\Vo\\H'^{4)nH'^+^{2)nH'^+^>} < Cj 

then 

(3) 

sup {i|f/(-,T)||^<T(4)n^<T+4(2)nH<'+8, ||^(-,r)||H<'(4)nH^+4(2)n//<'+8} < Ci- 

Te[0,To] 

On the other hand it is less clear that solutions of (|IW|) and (|LK|) will 
remain bounded over the very long time scales necessary for the KdV ap- 
proximation. In ISE] we proved the following result which guarantees that 
the solutions of the modulation equations remain bounded for sufiicienly 
long times. 

Proposition 1. Fix Tq > and a > 11/2. Suppose, Uq,Vq satisfy ^ 
and U , V , P^ , F and G satisfy \KdV\) . iTT]) and \LK\) . then there exists 
a constant C2, independent of e, such that the solutions of Qj and iLK\) 
satisfy the estimates below: 

sup ||P^(-,r)||^.+3 <C2 

rg[0,Toe-2] 

sup {||F(-,r)||^an^.-4(2), ||G(-,r)||^.nH— 4(2)} < C'2- 

TG[0,To] 

Moreover, P^{P,t) = ip^{f3±,T) with 

sup \\f^{-,T)\\H.+3r,H—H2)<C2. 
TG[0,To] 

Finally we note that since drP = dpP~^ — dj3P~ we have ||9t-P||s < 

\\dpP\\s. 

With this preliminary result in hand we can state our principal results. 
Denote the sum of the modulation functions, properly scaled, as 

-^jjd {^a, t) = e^U (e {a - t) , eh) + eV (e (a + t), eh) 

(4) + e^F (e {a - t) , eh) + e^G (e {a + t) , eh) 

+ e^P(ea,et). 

As mentioned earlier, (x, y) are not the natural coordinates to study solu- 
tions to ()WW|) . The coordinates we use are {xa,y, xtY^ . Xa is approximated 
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by V'"', and the functions y and xt are approximated by functions we denote 
ipy and •(/'") respectively. They are given by 

V'^(a, t) = e^U{e{a - t), eh) + eV(e(a + t), e^t) 

+ e^F(e(a - t), e^t) + e^G(e(a + t),e^t) + e^P(ea, et) 

^^^ + ^e^^l, t/(6(a - t), eh) + ^e%^V{e{a + t), eh) 

+ e^ {U{e{a - t), eh) + V{e{a + t),eh))^ 
and 

V'"(a, t) = e'^U{e(a - t), e^t) - eV(e(a + t), e^t) 

+ e^F(e{a - t), eh) - e^G{e{a + t), eh) 

+ e^ip'{e{a - t), e^t) - e'^(/3+(e(Q + t), e^t) 

^-e%_U{e{a-t),eh)-^.up_ 



(6) 



le^dj_Uie{a - t),eh) - -e%^Vieia + t),eh) 



+ ^e^U\e{a - t), eh) - ^e^V\e{a + t), eh). 

We discuss the origin of these equations in Section ^ 
The approximation will be valid in the space 

Our main result is: 

Theorem 2. FixTq, C/ > 0, s > 4, cr > s + 7. Suppose U, V, P, F and G 
satisfy equations IKdV]) . UW]) and iLE\) and ip , ijj'^ and ip''^ are the combina- 
tions of these functions given in Oj, |3p and |^. Then there exist eg > and 
Cp > such that the following is true. If the initial conditions for liWW\) are 
of the form ((xQ,(a, 0),y(a,0),Xi(a,0)) '' = (O, e^0y(ea), e^0u(ea)) with 

max{||6j(-)||^<T(4)n^<T+4(2)n/f<^+9} < Cj 
i=y,z 

then for e G (0, eo), there is a reparanieterization of the free surface such 
that the unique solution to liWW\) satisfies 

xa{;t) \ ( ^\;t) 

yi;t) - r{;t) 
M;t) J \ r{;t) 

for t € [0, Tge^'^]. The constant Cp does not depend on e. 

Remark 1. The loss of the one half power of e in Theorem\^is caused by 
the long wave scaling and not a lack of sharpness in the estimates. 

Remark 2. It is clear that the form of the initial conditions specified in the 
hypotheses of this theorem do not agree with those found by setting t = in 
the approximation inequality (unless, of course, ip = Q). This is precisely 
why we mention the need to reparameterize the free surface. We discuss this 
at length in Section\^ 
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Less technically, this theorem states that solutions to ()WW|) . in the long 
wavelength limit, satisfy 

Xaia,t) = ip'^{a,t) + 0{e^) 

for times of 0(e~'^). This is, as expected, a marked improvement over the 
use of KdV alone. 

We note that this is not the first time that linearized KdV equations have 
been put forward as a means to improve the accuracy of the KdV approxima- 
tion. Other instances where linearized KdV equations appear include Sachs 
[28] . Sattinger, Haragus and Nicholls Jl], Kodama and Taniuti |2^ and 
Drazin ^J|. Moreover, there have been numerous models put forward over 
the years which model water waves in the same scaling regime we are consid- 
ering. We refer the reader to Kodama ^3, Olver ^26;,, Bona, Pritchard and 
Scott J2j, Craig and Groves ^01 , Dullin, Gottwald and Holm ^2], and Bona 
and Chen |2|. Much of the work done in the above papers pertains to analyz- 
ing the behavior of the model equations and not to their connection to the 
original system. A notable exception is the recent work by Bona, Lannes 
and Colin jlj wherein they prove the rigorous validity of a large number 
of Boussinesq style models. Our particular combination of linearized KdV 
equations with an inhomogeneous wave equation appears to be unique and 
is asymptotically the most accurate model for long wavelength solutions to 
the water wave equation which has currently been justified rigorously. 

The remainder of this paper is organized as follows. First, in Section |2 we 
conduct a prelimary discussion of the water wave equation. Sections |21 and 
121 contain a thorough discussion of the operator K{x,y). Then, in Section 
|l]we derive the higher order modulation equations and prove an important 
estimate. In Sections IHl we prove the validity of the approximation i. e. 
Theorem 1^ Finally, Section [7| contains the details for a number of proofs. 

Acknowledgments: The NSF generously supported this research under 
grant DMS-0103915. As this work was done principally as the author's 
dissertation, special thanks also should go to Gene Wayne, who was the au- 
thor's thesis advisor, and to the Department of Mathematics and Statistics 
at Boston University, where he attended graduate school. 

2. Preliminaries 

We begin by discussing the water wave problem in greater detail. Consider 
an infinitely long canal of unit mean depth in two-dimensions (see Figure El). 
We denote the region occupied by the fluid at time t as ri(t), and the upper 
surface as T{t). We parameterize r(i) by {x{a,t),y{a,t)), where a G R is 
the parameter, and x and y are the real-valued coordinate functions. It is 
useful to break x up as follows, 

x{a,t) = a + x{a,t). 
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r(t) = {(a + x{a, t),y{a, t)) \a G M} 




n{t) 



Figure 6. The water wave in Lagrangian coordinates. 



We consider fluids which are inviscid and incompressible and flows which are 
irrotational. Also, we assume that the pressure on the top surface is a con- 
stant, and that the acceleration due to gravity is 1. With these assumptions, 
the evolution of x and y are given by the equations 



(WW) 



Xtt{l + Xa) + ya(l + ytt) = 
yt = K{x,y)xt. 



(See 0.) 

The first of these two equations is found from Euler's equations for fluid 
motion. The operator K in the second line is a transformation which is linear 
in xt, but depends nonlinearly on {x,y). That such an operator exists and 
gives a relationship between xt and yt is discussed in Sections El and El along 
with an analysis of K. Much of the difficulty in answering questions about 
the water wave equation is related to this operator. If the surface of the water 
is perfectly flat, i.e. x = y = 0, then we have K{0,0) = Kq where Kq is a 
linear operator defined by K{)f{k) = KQ{k)f{k), with KQ{k) = — itanh(A:). 
Notice that since tanh is a bounded function, Kq is bounded from H^ to 
H^. Kq will be appearing frequently. 

This formulation of the water wave problem is said to be in Lagrangian, 
or material, coordinates. In this point of view we are not in a fixed "lab" 
frame, but instead we are tracking the position of each "particle" of water 
separately. That is, (a + x{a,t),y{a,t)) gives the location of the particle 
which was initially at (a + x{a,0),y{a,0)). The laboratory, or Eulerian, 
point of view is to fix a system of coordinates on the fluid domain and to 
measure the velocity of the fluid at each point of this fixed reference frame. 
For our purposes, it is far more convenient to use Lagrangian coordinates, 
however experimentalists work with Eulerian coordinates. We will give for- 
mulae for the approximation in terms of Eulerian coordinates in a future 
publication. The interested reader may also see the author's thesis jHZI for 
this information. 
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Since the water wave equation is second order in time for both x and y, 
one might suppose that four functions are neccessary to specify the initial 
state of the system — x(a,0), y{a,Q), xt{a,0) andyt(a,0). In fact, in general 
only three are needed, and for initial data with small amplitudes, only two. 
The relationship yt = K{x,y)xt specifies the value of yt{a,0) given the 
other three functions. We can also "do away" with the initial condition for 
X, provided we are in the small amplitude, long wave limit. If x(a,0) and 
its first derivative are sufficiently small, then a + x(a,0) will be invertible. 
This implies that r(0) will be a graph over the horizontal coordinate. And 
so, without loss of generality, we can reparameterize the initial conditions 
so that 

r(0) = {{a,y{a,0))\a G R} . 

Thus we need only to choose y(a,0) and xt{a,0). As it turns out, we need 
to reparameterize the system one more time to prove the approximation 
theorem, but we will leave this technicality until Section The essential 
point here is that due to the freedom in choosing the initial parameterization, 
we can eliminate two of our initial conditions. 

Even though we can assume that x(a, 0) = (or, alternately, is small), 
this coordinate grows linearly in time. (See the linear estimates in Chapter 
2 of j;-{lj.) As we are concerned with very long time scales, this is a problem. 
As was shown in |,Slj . one can replace x with the new coordinate z = Kqx, 
which is well-behaved over long times. Rewriting ()WW|1 with this new 
variable, we have 



(WW3) ^ ^_ 



dtz = Kqu 
dty = K{z,y)u 

d^yil + dfy) 



dtu 



1 + K^^daz 



We will see that the operator K{x, y) in truth depends not on x but on 
z, so the abuse of the notation K{z^ y) above is in some sense legitimate 
(see Section 131 for further discussion). Furthermore, even though K^ is not 
well-defined, as it blows up at frequency A; = 0, the composition 

L = K^^da 

is well-defined as its symbol, — /c/tanh(/c), has no singularities; L is also 
invertible. Finally, we notice that the Maclaurin expansion of i^o(^) = 
—ik + 0{k^). Thus to lowest order Kq ~ —da- And so z ~ —Xa- This 
is precisely the reason why, in the Introduction, we stated that —Xa is a 
natural coordinate for the water wave equation. 

Though we will primarily be working with the three-dimensional system 
IWW3| we will need to embed this system into a four-dimensional system 
to prove certain aspects of the validity of the approximation. We introduce 
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the new coordinate a = ut, and (|WW.S|) becomes 

dtz = Kqu 
dty = K{z,y)u 
(WW4) dtu = a 

^ adgu + dgdtjl + dfy) + dtydfy 
*" ~ 1 + K^^dgz 

Though things appear to be getting out of hand, we remark that this is 
as large a system as we will need. Results in [2^ prove that solutions to 
equations (|WW|) . (|WW3|) and HWW4() do indeed exist for long times. We 
will be considering solutions to the four dimensional system which are in 

The main goal of this paper is to prove Theorem |2 To do this, we first 
prove a similar theorem for solutions to HWW3|) . from which Theorem [^J will 
follow. Let 

(7) ^'^(a, t) = ^^(a, t) + e^Wsiea, et). 

The additional function W3 will solve an equation we specify later. While 
we assure the reader that we will be following Chekhov's rule and that this 
gun which appears in the first act will be fired in the third, interested parties 
may look ahead to equation ()23() in Section ^ for more information about 
W3. Define the functions 






^y =m^ + e^^i + £6/^2 



^« =d-^dt^^ 



Ai and A2 are combinations of solutions to the modulation equations and 
are given in equations ((T7|) and ((T^ in Section |1J We justify the presense of 
the inverse derivative in ^" by means of example. ^^ contains the term U 
which solves a KdV equation. Thus dt'^'^ contains the terms from the right 
hand side of the KdV equation, all of which are perfect space derivatives. 
These then are cancelled by the inverse derivative. It is simple to check that 
this method applies to all terms in \1'°'. 
With these function, we have 

Theorem 3. Fix Tq, C/ > 0, s > 4, a>s + 7. Let ^^, ^ ^ ^^ and ^" be 
as above. Moreover assume the initial conditions for iKdV\) satisfy 

max{||t/o||H<^(4)nH<^+4(2)nH<'+9> Wo\\H'^{4)nH''+^(2)nH'^+^} < Ci- 
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Then there exist eo > and Cp = Cf(Tq, Ci,s) > such that if the initial 
conditions for I WW3^) are of the form 

z(a,0) \ / ^^(a,0) \ 

u(a,0) / \ ^"(a,0) / 
with ||fio||ft= ^ Ci then the unique solution of \WW3\) satisfies the estimate 

Zi;t) \ I ^^(.t) \ 

y(.,t) - ^y{;t) <Ci.eiV2 

n(.,t) / V ^>-{;t) j ^^ 

for t € [0, Toe^"^]. The constant Cp does not depend on e. 

3. The operator K{x,y) Part I: Basics and basic expansions. 

This is the first of two sections where we discuss the operator K(x,y) 
which gives the relation between yt and xt in the water wave equation. Here 
we briefly discuss the origin of this operator and report expansions of K 
found in previous work. We also state some very basic facts about these 
expansions. In Section El we quote more complicated results and prove some 
new technical extensions needed for our purposes. 

K{x,y) (or, more precisely —K{x,y)) is sometimes called the Hilbert 
transform for the region ^{t). Loosely, given a region Q. in the complex 
plane, and any function F which is analytic in fi, the Hilbert transform for 
i7, H{0,), is a linear operator which relates the real and imaginary parts of 
F on the boundary of Q. That is 

lm{F)\dn = H{Q)Re{F)\9n. 

For example if were the lower half-plane, then the Hilbert transform would 
be the operator H, given by Hf = i sgn{k)f. (This particular operator H 
is also frequently called the Hilbert transform.) The nature of the operator 
depends greatly on the region begin studied. Unsurprisingly, the proof that 
such an operator exists is connected to the Riemann mapping theorem and 
to techniques for solving boundary value problems for Laplace's equation 
in the plane. In this problem, since the region r2(t) is completely specified 
by the coordinate functions x and y, we denote the Hilbert transform by 
H{n{t)) = -K{x,y). 

As we are considering a fluid which is incompressible and a flow which is 
irrotational, xt{a, t) — iyt{a, t) is the value on the upper boundary T{t), of an 
analytic function on J7(t), uj{a + if3) = v^{a,P) — ivy{a, (3). Here (v^,f^) is 
the velocity held for the fluid in the whole region. Thus, given that K{x, y) 
exists, we have 

yt = K{x,y)xt. 
Of course, the boundary of Q(t) is not just T{t), but also includes the bottom 
of the canal {i.e. where (3 = —1). As we do not have fluid flow through the 
bottom, we have i;^(a, —1) = 0. 
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Under these conditions, K{x, y) has been analyzed extensively by Craig in 
[HI and Schneider and Wayne in [HJ. In particular Craig shows that K(x, y) 
has the following expansion: 

(8) K{x,y)u = Kqu + Ki{x,y)u + S2{x,y)u 

where 

KQu{k) = —itanh{k)u{k), 

Ki{x,y)u = [x,Ko]daU- {y + Ko{yKo))daU 

and ^2 is quadratic in (x,y). 

First of all we note that Kq is a bounded operator from H'^ to H^ since 
tanh(fe) is a bounded function. That is 

||-?^0«||s < ll^^lls- 

The operator L = Kq da, which is well-defined as we discussed in the 
Section 12 will also be used frequently. L is not a bounded operator on H^. 
It effectively takes one derivative. That is 

(9) ll-^^lls < lkl|s+i- 

On the other hand, L^^ replaces one derivative. That is, since 

\tanh{k)/k\ < (1 + A;2)-V2 
we know 

(10) \\L~^u\\s < Ms-i- 

We will be considering functions which are of long wavelength. That is, 
functions of the form /(a) = F{p) where /? = ea. We define operators -fCo.e 
and Lg via 

K^,,F{(5) = Kofia) 

L,F{P) = Lf{a). 

Taking the Maclaurin series expansion for tanh(/c) shows that formally 
Ko,e = -edp - y^dl - ^e'dl + 0{e'). 
Similarly L^ and L^^ have expansions in terms of derivatives 



We call such expansions of Fourier multiplier operators "long wave approxi- 
mations" . The rigorous connection between a long wave approximation and 
the original operator is given in the following Lemma, whose simple proof is 
contained in Section [7| 
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Lemma 1. Suppose A and An are linear operators defined by Af{k) = 
A{k)f{k), and Anf{k) = An{k)f{k) where A{k) and An{k) are complex 
valued functions. Also suppose that \A(k) — An{k)\ < C|A;|". (e.g. An is a 
Taylor polynomial for A.) Then for f £ jj^+it- yj^ have 

\\Af{.)-Anf{-)\\s<CWJ{-)\\s. 

Moreover, if f(a) is of long wavelength form — that is if f{a) = F{ea), with 
F £ JJS+1T- — then for < e < 1 there exists C independent of e such that 

\\Af{-)-Anf{-)\\s<Ce--'''\\F{.)\l+n. 

In j31j . Schneider and Wayne show that the operator K{x,y) does not 
depend on x per se, but rather on z = Kqx. We confuse the notation for the 
operators intentionahy. They show the K{z, y) has the fohowing expansion 

K{z,y)u = Kqu + Ki(z,y)u + S2{z,y)u. 



where 

with 



Ki{z,y)u = Mi{z)daU - (y + KoyKo)daU 

mh{z)v]{k) = - f ^'^I'Y^ zik - l)v{l)dl. 
J Ko[k — I) 

S2 is an operator which depends quadratically on z and y. Section [S] contains 
an analysis of these operators. 

By using the hyperbohc trigonometric identity 
(11) tanh(?) - tanh(fe) ^ ^ _ ^^^^^^^ ^^^^^^^ 

tanh(/ — k) 
we can simpHfy the expression for Ki to 

Ki{z,y)u=Mi{z + y)daU 

= -{z + y + Ko{z + y)Ko)daU 
Since we know Kq is a bounded operator, it is clear that 
(13) \\Mi{z)v\\s < \\zv\\s. 

4. The derivation 

In this section we will derive the higher order correction to the KdV 
approximation. For technical reasons, it is most convenient to work with 
the water wave equation written in the form HWW3|1 . Suppose that one 
is given the function ^(a,t) = (^''(a, t), ^^(a, t), ^"(a,i))*''. The amount 
that this function fails to satisfy HWW3|) is called the residual and is given 
by Res[^] = (Res^,ReSy,Res„)*^' with 

Res^ = dt"^' - i^o^" 

Resy = dt^y - K{^^,^y)^'' 

1 + Q^y^y 

Res„ = 9i^" + dc^^y * . 

1 + L*^ 
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For a true solution, notice that Res[^] is identically zero. 

Remark 3. We will also consider the four dimensional system iWW4[ )- If 
we let \I'" = dt^"^ , then we have the additional Res function 



Resa =9t*" + 






=dtResu + T-^—Resu- 

1 + L^^ 

The main goal when deriving modulation equations is to choose a system 
of equations such that solutions to this system yield a very small residual. 
This is different than (but connected to) showing that solutions to the mod- 
ulation equations are close to true solutions for the original problem. This 
latter issue is precisely that answered by the main results, Theorem |2l and 
Proposition |51 and is discussed in Section |SJ Here, we will perform a series 
of calculations on the residual and derive equations HKdV|) , (|IW|) and (|LK|) . 
In this process we guarantee the smallness of the residual. While several 
of the steps will initially seem to have little mathematical justification (i.e. 
they are formal), once the calculation is completed it will be obvious that 
all steps are valid. For example, we will take 

(14) *" = K^^dt^^ 

With this choice 

Res^ = 0, 

which is small indeed! However, Kq is not in general a well-defined opera- 
tor. Nonetheless, when we eventually select ^^, K^ dt'^^ will make perfect 
sense. 

We are looking for solutions which are small in amplitude and long in 
wavelength. So we let 

$^(a,i) =e^Z{l5,T) 

=e2Zi(/3,r) + e^Z2(/3,r) +e^Z3(/3,r). 

Recall (3 = ea and r = et. We require Res[^] to be 0(e^^/^). Loosely, we 
need three powers of e more than the expected error of 0(e^^' ^) to account 
for the long times (0(e~^)) over which our approximation will be a valid. 
See Schneider and Wayne ^] and Wayne and Wright [SB] . 

Remark 4. More specifically, if we wish to prove Theorem [^ in the space 
9)^ we will need 

\\Res,\\s <C7e^^/2 

WReSyWs <Ce^^/2 
\\ReSu\\s-i <Ce'^/' 
\\ReSa\\s-i <Ce''/' 
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forO <t< Toe^'^. Given the definition of Resa, the final estimate will follow 

automatically from the estimate on Resu- 
me, have already chosen \I'" in terms of ^^. We wih use first use the 

expression for Resj^ to similarly determine ^^ in terms of ^^. This is not 

as simple a matter because while Kq commutes with dt, the full operator 

K{z,y) does not. 
We have 

= - dt^y + dt^^ + Mi(^^ + ^y)KQ^dadt^' + 52(*^ ^^)^". 

Notice that in the above expression we can cancel the linear terms by taking 
^y ~ $^. More precisely, we set 

^y{a, t) = e'Z{f3, t) + e^^i{l3, r) + e^Aal/?, r), 

for as yet undetermined functions Aj. So 

Resy = - e^dr^i - e'^dr^2 + Mi{2e^Z)L,e^drZ 

+ Mi(e^Ai + e^^2)L,e'^drZ + 52(^^ ^")*" 
= - e^dr^i - e'^dr^2 - 2e^ZL,drZ - e^Ko,e {2ZdpdrZ) 
- e'^AiL.drZ - e^i^o,. (AidfsdrZ) + e^Mi{A2)L,drZ 

A number of the terms in Res^^ are already 0(e^^'^). By Lemmas we 
have the following estimate on Kq ^i 



(15) \\Ko,eF\\s < e'^^F\ 



s+l- 



Thus terms containing i^o.e can be considered to be a power of e smaller 
than they appear (though this costs a derivative). On the other hand, Kq 
is bounded so we have 

(16) ||Ko,.i^||. <e''/'||i^lU. 

Thus we can use i^o,e either as a bounded functional or to gain powers of 
e, but not both. Notice that L^ does not contribute any additional powers 
of e in any case. We separate out all the terms that are already sufficiently 
small into error terms. That is 



Resy = - e^S^Ai - e^^^As - 2e^ZiL,9^Zi 

- 2e'^ZiUdrZ2 - 2e^Z2L^drZi 

- e^Ko,e {2Zidpd^Zi) - e'AiL.drZi 

+ ^small + ^S2 



18 J. DOUGLAS WRIGHT 

with 
Elmaii =2e^ZsL,drZ + 2e9(Zi + e^Z2)L,drZs + 2e^Z2LAZ2 

- e«iCo,e ((2^2 + 2e^Zs)dfsdrZ) - e^i^o.e {2Zid^dr{Z2 + e^Zs)) 

- e^AiL,driZ2 + e^Zs) - e^Ko,e (AidpdrZ) + e^Mi{A2)L,drZ 

It is clear that S^^^,^ is 0(e^^/^). That is 

ll-^sma//lls - "-^^ 

The constant C depends on various norms of the functions Zj, drZi and 
Aj. Specificahy, chasing through the various terms in E^^^^^n and applying 
the estimates in @, (fT^ . (jlSf) and ((TB|) . one can show that C depends on 

ll'2'ills+i) ||-^2||s+i, ll^slls, ||(?T-Z'i||s+2, ||9tZ2||s+2, II<9t-Z3 ||s+i, ||Ai||s+i and 

IIA2IU. 

The term Eg is also 0(e^'^'^) though this is not as obvious. We prove 
this in Proposition IHl in Section El The proof of this relies strongly on the 
fact that we have taken ^^ and ^^ such that ^^ — ^^ is O(e^). This causes 
a cancellation in S2, which in turn makes this term small. 

We now expand L^ and -fCo,€ in the remaining low order terms in Res^ to 
find 

Resy = - e^drAi - e^5^A2 + 2e^ZidrZi + ^e'^ZidrdjZi 

+ 2e'^ZidrZ2 + 2e'^Z2drZi 
+ 2e'^d(}Zidf3drZi + e^ AidrZi 



with 



+ ^ small + ^$2 ~*~ /wa 



1 



^La = - ^^^Zi{L, + 1 - -e^d})drZi - 2^Zi{L, + 1)9,^2 

- 2e^Z2(L, + l)drZi - 2e6(i^o,. + edp){ZidpdrZi) 

-e^Ai{L, + l)drZi. 

Each term in Ef^^ is ©(e"*^"^'^) by Lemmas That is 

where C depends on ||Zi|ls+3,||Z2||s, ||(9t--Z'i||s+4,Pt-Z2||s+2 and ||Ai|[s. (The 
subscript "Iwa" stands for "long wave approximation".) 
The only 0{c') terms remaining in ReSy are 

-e'drAi + 2t'ZidrZi 

which we remove by selecting 

(17) Ai = Zl 
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So 

Resy = - e^a^As + -e'^ Zidrd}Zi 

+ 2e'^ZidrZ2 + 2e'^Z2drZi 
+ 2€^ dpZidpdrZi + e'^ZldrZi 

' small ' b2 Iwa 

The remaining O(e^) terms in ReSy are all perfect time derivatives with 
the exception of 



4 
Notice, however, that 



^ZidrdlZi. 



dr {^ZrdlZ^ - \{drZrf 

=^ZidrdlZi + ^drZ2 {dlzi - dlzi) . 

Given the form of the approximation in equation (^ , it is not unreasonable 
to suspect that 

(18) djZi - dlZi ~ 0(e2). 

We are now in a position to select A2. Taking 



(19) 

gives 
where 



A2=(5^^i)' + 2ZiZ2 + ^Zf 

+ ^Z^dlZ^-'^{drZif 



ReSy — E^^^ii + Eg^ + Ei^^ + -E^^jjc/i 



ELucK = \^''drZ,{dlZ,-dlZ,] 



Given that our assumption (|18j) is valid, we have shown that with our choices 
for '^y and ^^ in terms of ^^ that ReSy = 0(e^'^/^). More specifically, we 
have shown that if 

\\dlz^-dlz^\\s<Ce''/\ 

then 

ReSy < C7e^^/2 
where the constant depends only on ||Zi|[s+3, ||Z2|ls+i, H^sHs, |1i9t-Z'i||s+4, 

||9t^2||s+2, ||<9r^3||s+l- 

Now that we have computed ^^ and ^" in terms of ^^, we now turn our 
attention to determining ^^ by examining Res^j. 



* " 1 + L^^ 



Res„ = K^^d'^-^^ + da^-^y- 
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We expand (1 + L'l'^)"^ by the geometric series to find 
where 

1 -I- f)^^y 

Since ^^ is "small" this error term can be shown to be 0(e^'^'^). We have 

Lemma 2. Let f G H^+^, s > 0. Take eq such that e§||/||Loc < 1/2. Then 
for < e < €q we have that the function 

9i^-) = TT^fi^) - 1 + ^'/(^^) 

satisfies \\g{E-)\\s ^ Ce'^'^ for C independent of e. 

Remark 5. Under the same hypotheses as in Lemma\^ arguments similar 
to the proof of that Lemma show: 

• (1 + e^/(ex))~^ G C* and is hounded there independent of e, 

• (1 + e^/(ex))""'^ — 1 E H^ and has norm there hounded hy Ce^'"^ for 
C independent of e and 

• (1 + e^/(ex))~^ — 1 + e^/(ex) — e^/^(ex) G H^ and has norm there 
hounded hy Ce^^'"^ for C independent of e. 

Now, after substituting in from the definitions of ^^ and ^^, we collect 
all the terms which are smaller than 0{e^'^''^) and find 

Res„ =e^K-ldUZi + e^Za + e^Zg) + e^a^Zi - e^S^ZiL.Zi 

+ €^d(3 {Z2 + Zl) + e''d0Zi {djZi - L,Z2 + (L.Zif) 

-e'^dp{Z2 + Zl)L,Zi + e'^dpZz 



+ e'^dp ({dpZif + 2Z1Z2 + ^Zf\ 
+ e'^^p(^p,^}Z^-'^{^rZ,f'^ 



' geo ' -^ small ' -'--'switch' 

We omit the exact expression for -E"^^;; because it is both lengthy and 
uninteresting. We have 

(20) \\E-^,ii\U<Ce''/' 

where the constant C depends on H'^^^ norms of the functions Zi and the 
H'^ norms of d'^Zi. We have also replaced one instance of d'^Zi with d'iZi 
(much as we did earlier), thus the term 

E switch = e 9/3^1 (9^Zi - 9^Zi) . 

We now define new functions Wi by Zi = Lj^Wi. This seemingly mys- 
terious (and sudden!) change of variables will seem less so if we remind 
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the reader that at the end of the day we wish model not z but rather the 
function Xa- Accordingly, if we approximate x^ by a function 

then it is logical to take 

and in the long wavelength limit we arrive at these functions Wi. 
So we have 

Res„ =€^d^^dl{Wi + e^W2 + e^W^) + e^Ko.e^i - e^WiK^^Mi 

+ e^Ko,,W2 + e'^dpiL-^Wif 

+ e^Ko^Mi {€^KQ^,dpWi - e^W2 + e^Wl) 

- (e6ifo,eH^2 + e''dp{L-^Wif)Wi + e^i^ce^^s 

~r geo ' ^ small ' switch' 

At this time, the presence of inverse /3 derivatives may seem problematic. 
Notice that each such precedes a time derivative. Once we select the func- 
tions Wi we will see that there can be an exchange between time and space 
derivatives, which will justify the instances of d^ . 

Now we replace Kq ^ and L~^ by their long wave approximates and find 



Res„ =e''d/d^{Wi + e'Ws + e^Wg 
2..._ , ^'01 + ^^^'-'^ 



e\ed^ + -e^d} + -e'd'p)W^ 



- ^\^dp + 3e^5|)M^2 + e''dp{Wif + -e^ dp{WidlWi) 

- e'dpWi {-dlWi -W2 + Wl) 

- e''Wi{-dpW2 + dp{Wif) - e^dfiW^ 

+ e^dfi ({dfiWxf + 2WxW2 - ]^l 
+ e'dp [^^dlWx - \idrWxf\ 

' ''--'geo ' ^ small ' -'-^switch ' ^Iwa' 

The error made by the long wave approximations is denoted Ef^^. By 



Lemma n we have 



I^Lall. < Ce'-^'^ 
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where C depends on ||H^i||s+7, ||^2||s+5, ||W^3lls+3, ||5rVFi|[s+2- 
Now we organize the above as 

+ e'd^^dlWi - e>dpW2 






(21) - ^e'^d^pWi + e'^WidjWi 



'3. 



+ e''2df3{Wid}Wi) + e'^ -df3{d(3Wif 
+ e'd^^d^Ws - e'dpWs 

' ^geo ' ^ small ' ^switch ' ^Iwa' 

The term on the first hne of right hand side looks formally like an inverse 
derivative of a wave equation: 

d^^dlWi - dpWi = dp^ {dlwi - ajWi) . 

We cancel this term (to lowest order) by taking Wi of the form 

Wi{(5, t) = -U{(5 - T, eV) - V{f3 + r, eV). 

Recall I3± = P ± t and T = e^r. The "minus" signs may seem arbitrary, 
but are included at this stage so that they agree with previous work in the 
area. Noting that the third line looks very much like the first, we also set 

W2{P, r) = -F(/3 - r, eV) - G(/? + r, eV) - P(/?, r). 

These choices for Wi and W2 are precisely those described heuristically in 
the Introduction. 

The first three lines in l\'21\i become 

5 / on rr , ^ ^3 tt , ^a. tt2 



e'[2dTU + -d$_U + -d^_U 



W hdxF - 2dTG - dp^ [d^U + dlV)\ 
-e^a-^ (5|F + dlG) . 
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.5 



We cancel everything multiplied by e by taking 



-2dTU = ^d^u + -d0_u^ 



2dTV = ^d}^V + -dp^V' 
dlP - djP = 3d}{UV) 

which are precisely equations HKdV|) and ()IW|) . By Proposition ^ we know 
the solutions to these equations are well-behaved over the long time scales. 
Given that the functions U and V have been chosen to solve ()KdV|) . one 
computes that 



Thus the term 9^ (^d^U + d^V) is perfectly well-defined. For brevity, we 
will continue to write these terms with the inverse derivatives instead of in 
the longer form above. 

Moreover, now we can put more precise estimates on E^^^^^f^ and -E^^j^ch- 
In particular, since each time derivative for solutions to KdV equations count 
for three space derivatives, we have 

\\djWi - d^WiWs <Ce'l'^ 

where C depends on ||PFi||s+6. 

Recall from Fact^and Propositionnthat solutions to H1W|) can be rewrit- 
ten as 

P(/?,r)=P+(/3,r)+p-(/3,T) 

=V9+(/?+,r)+(^-(/?_,r). 

The functions ip^ are rapidly decaying. We make this decomposition so that 
every remaining term in (|21|) : 

• will be a unidirectional term which is rapidly decaying; 

• will be a product of two such terms which are moving in opposite 
directions; 

• or will include a derivative of W^- 
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That is 



where 



and 



Resj, =e^ (2dTF + ^d} F + 39^_ (UF) + J" 

+ e' (-2dTG + ^d^^G + Sdp^VG) + J A 

- e'd^' {d^F + d^G) 

-I- 77"^ -I- P" -I- 77"^ -I- P" -I- P" 

"•" geo ' small ' switch ' Iwa ' time ■ 



J- = 3df3_{Uip-) + 4?72a^_;7 + ^t/a;^_t/ 
(22) + Y^/^+^^i^ + T^^k^ + ^^i^^ 



5^+4^ 



+ dp (v{3F + 3^- + 4C/2 + ldl_ UU 

+ 4df3{dp_udp^v). 



~ e'^^dp (^{drU + drvy 

Notice that J^ = J^(/3±,T). -E^^g (so called because each term in it 
contains some sort of time derivative) is clearly 0(e^^'^). That is 

II F"" II ^ r<^i'i'/2 

The constant above depends on ||C/||s+4 and ||y|[5+4. 

The term e^d^^ {^tF + d^G) is not included in E'^^e fo"^ the following 
reason. In a moment, when we select the equations F and G solve, a conse- 
quence will be that there will be terms in d^F and d^G which are 0(e~^). 
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By taking 

-2dTF = ^dp_ (UF) + ld}_F + J' 

we cancel nearly all the terms which are not in the various E^ terms. These 
are the linearized KdV equations ()LK|) discussed in the Introduction. Propo- 
sition ^ guarantees that the solutions are well-behaved. We are left with 



Res^ = + e^ (d^^ {d^^Ws - djWs) + r 
- e^a^i (4f + d^G) 

' ^geo ~r ^ small ' ^switch ' ^Iwa ' ^tivae' 

Now we consider the terms in d'n {d^F + d^G). Notice that 
-d^^d'^F =a^i5T(^a|F + hpiUF) + i J-) 



J^ contains the term 3df3_{U(p^) + -^d^ (p^ . From the definition of (p~ we 
know 



So we have 



drip- = -e-^^dpiUV). 



3 __. 3.,„ _ , 1 2 
3 „ .. 9 . „ .,,_. 1 



-ip drU + -Udrif + -dj_dTf~ 

-2a3 



We treat d^G in the same fashion. So we can write 

-e'd-p^ {dlF + dlG) = Ela - e' (^(f/ + V)dpiUV) + ^a|([/y)) . 
By construction EpQ satisfies the estimate 

\\E^F,Gh<Ge''/' 

where G depends on ||C/||s+7, ||^||s+7, ll-^lls+5, and ||G||s+5- 
We have 



Resj, = + e^ (d^^ {d^Ws - djws) + r 

"T ^qeo "T ^ small "^ ^switch "^ ^Iwa "^ ^time "^ -^F,^- 
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By selecting 

(23) d',W3 - djWs = dp {^-{U + V)dp{UV) + \dliUV) - A 

the gun goes off and we cancel all remaining O(e^) terms. Thus 

ReSy = -Egeo + Egj^^ii + -Eswitc/i + ^Iwa + ^time + ^F,G- 

Each of the S" is 0{e^'^/^). 

Unlike the previous equations ()KdV|) . H1W|) and (|LK|) . Proposition ^ does 
not tell us that the solutions to (|23jl are controllable. Nonetheless, equation 
(|23|) is an inhomogeneous wave equation where the inhomogeneity consists 
entirely of terms which are products of left and right moving rapidly decaying 
functions. From Wayne and Wright j3bj we have the following Lemma: 

Lemma 3. Suppose 

dru ± dpu = 1{I3 + r, eV)r(/3 - r, eV), n(X, 0) = 0. 

with \\l{-^T)\\^s(^2) ^ C' and \\r[--,T)\\^s^2) < C for T G [0, Tq\, then 

\\u{(3,t)\\s<C 

for all T G [0,Toe^^]. The constant C is uniform in e. 

Thus W3 will remain 0(1). 

Remark 6. If we are in the situation in which Proposition^\ applies, we see 
that least regular part in the driving term are d'i{UG) and d'i{VF), which 
are in H^~^{2). Thus, by this Lemma we have that W3 E H'^~^ for all times 
of interest. 

At this time we have derived the modulation equations and shown the 
residual is small. The only remaining order of business in this section is to 
determine how smooth the solutions to our modulation equations need to be 
in order for Res[\I'] to be appropriately regular. This may seem to be a fairly 
tiresome task, but fortunately the least regular terms in all of the sundry E 
functions come from only one term — E^^^^ ! This is because -E"^^;; contains 
many time derivatives. 

We need to control Resa in H^~^. For this we need dt^esu € H^^^, which 
in turn implies that we must have dtE^^^n £ H^^^. Recalling equation 
((201)) ^^ see that this will require d^Z2 € H^~^, or rather (since L~^ saves a 
derivative) d^W2 G H^~^. For this, we need dj^F and dj-G in H^~^. Given 
that F solves ()LK|) where J~^ contains the terms 9l U, 9? if~, one sees 
that dj^F will include the terms d%F, dl\U and d^_(p^ . So ||9|.F||s_2 is 
controlled by the H^~^^ norms of U and V , and the H^^'^ norms of c/5~, F 
and G. The analogous result is true for dj.G. We also need d^W^ G H^~^. 
Since W^ solves ((28]). we require VF3 G H'^'^^. 

In summary we have the following Proposition. 
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Proposition 2. Take ^'^ as in ^, with U, V, F, G, P and W^, solving 
their respective equations. Let 

mv =^^ + e%i + £6^2 

with Ai and A2 as in \11^ and iiy\) . and form Res[^] as in ^. Then 

\\Res,\\s <Cei^/2 
WResJs <Cei^/2 

M y w^ — 

\\ReSu\\s-i <Cei^/2 

\\ReSa\\s-i <Cei9/2_ 

where C is a constant which depends on \\U\\s-\-g, \\V\\s-\-g, ||P||s+7, ||i^||s+7j 
||G||s+7 and ||1^3||s+i. The estimate ^ holds as long as these quantities 
remain bounded. The constant C does not depend on e. 

In light of Proposition n and Remark |SJ we see that if in we take the initial 
conditions for U and V to satisfy ^ with a > s + 7, that ||C/||s+9, ||l^||s+9, 
||P||^+7, ||F||^+7, ||G||,+7 and ||W3||,+i are all 0(1) for t G [0,Toe-^]. And 
so we move on. 

5. The operator K{x,y) Part II: Estimates and Extensions 

In this section we will describe a few more estimates related to K{x,y). 
All such estimates are either smoothing estimates or ones which show that 
certain terms are small in the long wavelength setting. 

First, since 1 + Kq (k) goes to zero exponentially fast as \k\ — > 00, the 
operator 1 + Kq is smoothing. That is, for all s > 

\\{1 + kI)u\\s<C\\u\\l2. 

Also, commutators involving Kq are smoothing. We quote the following 
Lemma from 31 . 

Lemma 4. Let r > 0, q > 1/2, and < p < q. Then there exists a C > 
such that 

\\[f,Ko]g\\r<C\\f\\r+p\\g\U-p. 

Proof. See Lemma 3.12 on p 1498 of |Sl]. D 

Schneider and Wayne show that Ki{z,y) is a smoothing operator. 
Lemma 5. For r >0, q > 1/2 and < p < q, there is C such that 

(24) \\Kl{z, y)u\\r < C{\\z\\r+p + ||y||r+p) Ihllg-p- 

Proof. See Corollary 3.13 on p 1499 of |ST]. D 
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If we let S\{z,y) = K{z,y) — Kq, we also have the following estimates 
from 1311: 



Lemma 6. Fix s > 4. // the free surface is sufficiently smooth, then for 
j = 1,2 we have: 

\\Sj{z, y)u\\s < C {\\z\\i + ||y||^) ||m||3, 
that is, Sj is a smoothing operator, 

\\d^{S,{z,y)u)\U<C{\\z\\i + \\y\\^^\\uh, 
that is, daSj is a smoothing operator, 

\\[dt,Sj]u\\<C{\\z\\i + \\y\\i)\\uh, 

that is, [dt,Sj] is a smoothing operator and this operator can be 
bounded independently of dfU and 

\\[dlSj]u\\<C{\\z\\i + \\y\\i)i\\u\U + \\dtu\U), 

that is, [df,Sj] is a smoothing operator and this operator can be 
bounded independently of d^u. 

Proof. In |35, see Lemmas 3.14, 3.15 and Corollary 3.16 on pp 1500, 1506 
and 1507 respectively. D 

We will also need the following propositions concerning the behavior of 
the remainder terms Si and 5*2 . The first of these says that more or less the 
remainder 5*2 is negligible for the sort of scalings we are considering. That 
is to say, the term £'| in the Section |3 is very small. 

Proposition 3. Fix s > 5/2. Suppose z = e^Z{ea), y = e'^Y{ea) and 
f = e^F{ea), with Z,Y,F G H''^^{2). Moreover, assume z — y = e^A(ea) 
with A E H^~^^{2). Then there exist €q such that for e € [0, eo] there is a 
constant C independent of e such that: 

\\S2{z,y)f\\s<Ce^y\ 

The second is a technical version of the mean value theorem as applied 
to the operator ^i. 

Proposition 4. Suppose z{a,t) = e^Z(e(a ± t), e^t), y{a,t) = e'^Y{e{a± 
t),€H), u{a,t) = e^U{e{a±t),€^t) and f{a,t) = e^ F {e{a ± t) , eH) with 
Z,Y,U,F G H'{2) fort G [0,Toe-'^]. Also suppose W{a,t), Ry{a,t) and 
R'^{a,t) G H^ for the same time interval. Then 

\\S,{zi-) + e''/'R^{-),y{-) + e''/^Ry{-))fi-)-S^iz{-),y{.))f{-)\U<Ce'y'' 

for t e [0,To€-% 
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Proof, for Proposition El 

First, notice x{a) = L Lz{a)da = eX{ea). We know that X is in L°° by 
the fohowing Lemma. 

Lemma 7. Suppose f{a) = e^F{ea) with F G H'^{2). Then for all a 

pa 

I / fia)da\ < C€\\F\\hs(^2) 
Jo 

Proof. See Section d D 

Let ^{x,y) = (<I>i,<I>2) be the analytic map which takes r2(t) to 

^" = {(e,7)l7e[-i,o]}. 

That such a map exists and is analytic is guaranteed by the Riemann map- 
ping theorem. Let 

h{a) = ^i{x{a),y{a)) 
and Qf = f o h. From [SJ, we know that 

(25) K{x,y)f{a)=QoK^oQ-^f{a). 

We can derive a very useful implicit formula for h^^ as follows. The 
function $"^(^,7) is analytic on P~ , thus it satisfies the Cauchy- Riemann 
equations. If we set 

and notice that <1>~^ sends the bottom and top of P~ to the bottom and top 
of Q.{t) respectively, we see that we have the following system 

d^ui + djVi = 0, 

d^ui — d^vi = 0, 

vi{C,-l)=0, vi{tO)=r](C) 

where r/(^) = y(/i~^(,^)). One can solve this system with the use of Fourier 
transforms relatively simply. If one does so, one finds that 

ui(e,o) = - / 77(6^1 - M7?(e) 

Jo 
where M is the pseudo-differential operator given by 

-r~~/,x /ccosh(/c) — sinhfA:)^,, , 
iksinh[k) 

Notice that to lowest order, M is Cda- 

Now, notice that h~^{^) = x~^{u(S,,0)) and so we have an implicit equa- 
tion for h~^. 

h-\0 =x-' U- I v{Ci)d^i - Mr^iO 

(26) ) -^^^ 



-X 



' (^ " i ^(^"'^^i))'^^i - ^^(^ ° ^"')(^) 
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Remark 7. In Schneider and Wayne PTl, on p 1494, they make a minor 
error in calculating this same function. As a result, they claim the above 
representation gives an explicit formula for h^^ . Our correction here changes 
nothing about subsequent steps in their proofs. 

Since, x = a + eX{ea) where X is well-behaved, we can expect a similar 
form for x~^. 

Lemma 8. Suppose f{a) = a + g{a) with \\g\\c2 < 1/2. Then 

rHO =C- 9(0 +9(09' iO + E 
where E = 0(||g||^2,oo)- More specifically 

E <C (llfi-'llioo IbllL-" + llffllioo I|5"I|l-') ■ 
In particular, notice that if g{a) = eG(ea) this means E = O(e^). 
Proof. See Section |7| D 

We apply this Lemma to x, and find that 

x-\£.) = e - eX{eO + e^'X^eOdpX^eO + 0{e'). 

Combining this with (|26|) we can determine h~^{^) (and therefore h) in 
terms of x and y to any order we wish. To lowest order we see that 

(27) h-\0=C + O{e) 

So now we have 

h~\0=C- I y{h-\Ci))dCi-M{yoh-^){0 
Jo 

-ex(^e(^C- J^ y{h-\Ci))dCi - M{y o /i-i)(o)) 

+ 0{e') 

=e- / e^Y{eh-^{i{))dii-M{e^Yoeh-^){i) 
Jo 

-ex(^e(^C- J^ e^Y{eh"\Ci))dCi - M{e^Y o eh'^m^ ) 
+ 0(e3). 
If we insert (|27|) into the above and expand we have 

h-\0=i-eX(e£,)- I eMe6)Ki-e'M(y(6-))(e) + 0(e'). 

Jo 

One can continue in this manner and determine the next order terms in 
the expansion oi h~^. If we let 

eGi{ea) = -eX{ea) - / e^Y{eai))dai - e^M{Y{e-)){a). 

Jo 
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the expansion is 

h-HO = C + eGi(eO + e^B^) + O(e') 

where 

f< 
Bi{^) = / eGi{ea)d/3e^Y{ea)da 
Jo 

Notice that since M is Cda to lowest order, -e^M(y(e-))(a) is O(e^). 
Moreover, by hypothesis, we have e^Z{ea) — e'^Y{ea) = e^A(ea). Thus 

pa pa 

-eX{ea)- / e^Y{ea)da = - / {e^dpX^ea) + e^Y{ea)) da 
Jo Jo 

pa 

= - {€'^L{Z{e-)){a)+€^Y{ea))da 
Jo 

= / {e^Z{ea) - e^Y{ea)) da + 0{e^) 
Jo 

= r (e^A(ea)) da + 0{€^) 
Jo 

That is, eGi is really 0(e^)! This cancelation is the crucial step in this proof. 
Since eGi appears in each term in Bi, we have shown 

with e^G = eGi. We appeal to Lemma |H1 again, and we have 

h{a) =a-e^Giea)+0{e^). 

Now that we have particularly good estimates on h and h^^, we can begin 
our discussion of K in earnest. For notational simplicity, we will let 

h{a) = a + gi{a) 

h-\0 = C + g2{0 

If we let 

f = Q-'f 

we can make the following formal approximation using Taylor's theorem, 
K{x,y)f{a)=QoKof{a) 
=Kof{h{a)) 
=Kof{a + gi{a)) 

=Kof{a) + gi{a)Kodaf{a) + h.o.t.. 
Also by Taylor's theorem, 

/(a) = /(a) + g2{ot)daf{a) + h.o.t.. 
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Putting these together we have 

K{x, y)f{a) =Kof{a) + gi{a)Kona) + Ko{g2f'){a) + h.o.t.. 
So let 

Eif =Kof{a) + gi{a)Kof'{a) + i^o(92/')(«) 
E2f=K{x,y)f-Eif. 
We prove Proposition |31 if we can prove 

. \\Eif - Kof - Ki{x,y)f\\s < Ce^y", and 
• \\E2f\\s < Ce'y\ 
Let us deal with £^2/ first. We can rewrite £"2/ as: 
E2f = Elf + Elf + Elf 
with 

E^ =K{x, y)f - Kof - giKodJ 
Elf =Kof - Kof - Ko{g2daf) 
Elf =giKodJ - giKodaf 

As our approximation for K was determined by an application of Taylor's 
theorem, we need to prove a Lemma which shows that this formal step can 
be made rigorous, at least for functions in the weighted Sobolev spaces. 

Lemma 9. Suppose F G H'^{n), s > 1/2, n > 1/2. Then for all Co > 

there exists eo such that for e G [0, eo] there is a constant C independent of 

€ such that: 

/ \ 1/2 

/ \F{ea)\^da] <Ce^^-^l\ 

\J|a|>CoE-3 / 

Moreover, for 1 < j < s we have: 

( \ ^^^ 

/ \dlF{ea)fda\ < Ce2"-3/2+i. 

Proof. See Section |7| D 

Remark 8. If instead we are considering 

I I \F{e{a±t),e^t)\^da] 

\J\a\>Coe-3 J 

with F{-,T) € H^{n) for T € [0, Tq], we can maintain the same bound as 
above by taking Co > 2To. 

We can use the above Lemma to prove a version of Taylor's theorem. 

Lemma 10. Suppose F G H''{2), s > 5/2 and g G L°° . Then 

\\F{e-+e^g{e-))-F{e-)U.<Ce^'\ 
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Proof. By Lemma IHl we have 

\\F{e-+e'g{e-))-F{e-)f 

= / \F{ea + e'^giea)) - F(ea)pda 

J\a\<€-^ 

+ \F{ea + e^g{€a))-F{€a)\^da 

J\a\>e-3 

< f \F{ea + e^giea)) - F(€a)\^da + Ce^. 

J\a\<€-3 

Now, we add and subtract e'^g{ea)F'{ea) in the remaining integral, 
/ \F{ea + e^g{€a))-F{€a)\^da 

J\a\<e-^ 



< / \F{ea + e'^giea)) - F{ea) - e^g{ea)F' {ea)\^da 
+ / \(i^g{ea)F'{ea)\^da 

J\a\<e-^ 

< / \F{ea + e'^giea)) - F{ea) - e^g{ea)F' {ea)\^da 



+ e'U-)\\l^\\F'{-)\\l.. 

We naively bound the above integral and apply the mean value theorem. 
That is, 

/ \F{ta + e^giea)) - F{ea) - e^g{ea)F\ea)fda 

J\a\<e-3 

<Ce-^ sup \F{ea + e^g{ea)) - F{ea) - e^g{ea)F'{ea)f 

\a\<e-3 

<Ce-^ sup |eV(ea)i^"(ea*)P 

|Q|<e-3 

^^£ \\9\\l°° W-^ \\l^- 
With this, we have proven the Lemma. D 

Remark 9. With this general technique we are also able to show that 

\\di{F{e-+e'g{e-))-F{e-))h.<Ce'/'+^, 

\\di {F{e ■ +e'g{e-)) - F{o) - e^g{e-)F' {■)) ||^. < Ce'/^+\ 

\\di{F{e-+^g{e-))) 
- di (F(6.) + e'g{e-)F\.) + l/2e^g\e.)F'\e-)) h^ < Ce^^s+i 
and so on. 
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Now we will be able to control i?2- We can control the other functions in 
precisely the same fashion. Since / is long wavelength and of rapid decay, so 
is /. And thus we can use Lemma [TUl In what follows, e^F{ea) = ii'o/(a). 
(The extra e comes from the long wave approximation of Kq.) 

\\Ey{-)\\s 

= \\Kofi- + gi{-)) - Kofi-) - gi{-)KodJ{-)\U 
<Ce^Fie ■ +e^G(e-)) - Fie-) - e^Gie-)dpFie-)\\s 

Now we turn our attention to Eif—Kof—Kiix, y)f. A routine calculation 
shows that this is equal to 

- e^[Giea),Ko]e^dpFiea) - KiieX,€^Y)e^Fiea) 

+ (51 + €'G)Ko€'dpFiea) + Kq ((52 - e'G)e^dpFie-)) (a) 

Now, gi + e^G and g2 — e'^G are O(e^), so the second line above can be 
bounded by Ce^^'^, if we make use of Kqs long wavelength approximation. 
Moreover, we claim that the first line is identically zero. Let 

bia) = - e2y(eai)dai - e2M(y(e-))(a) 
Jo 

yiai)dai — My(a). 


Thus e^G(ea) = 6(a) — xia). So 

e''[Giea),Ko]e^df,Fiea) = -[xia),Ko]dafia) + [bia),Ko]d^fia). 

Taking the Fourier transform of the second term we have 

=di[bia),Ko]dafia))ik) 

Koil) - Koik)] bil - k)ilfil)dl 

I ^'%l'[f'^ ^(l - m - k)Ufil)dl 
Koil)-Koik)- 



iil - k) 



-dabil - k)ilfil)dl 
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Now notice that dab = Ly, so the above becomes 

Ko{l) - Ko{k) 



i{l-k) 
Ko{l)-Ko{k) 



Ly{l - k)ilf{l)dl 
y{l - k)ilf{l)dl 



Ko{l - k) 
{1 + Ko{k)Ko{l))y{l - k)ilf{l)dl 

=^{{y + KoyKo)daf){k). 

where we have used the trigonometric identity l\ll\i from Section^ 
That is 

e^[G{ea),Ko]e^df3F{ea) 

= - [x{a),Ko]daf{a) + {{y + KoyKo)daf) (k) 

= -Ki{x,y)f 

and so we are done with the proof of Proposition |31 



D 



Proof, for Proposition 01 

Let h be as in the above subsection and /i2 be the analogous function for 
the configuration {z2,y2) = {z + e'^/'^R^,y + e^-^/^i?^). We also define the 
function X2 = a + X2 by 8^x2 = Lz2. Unlike in the previous Lemma, here 
the time dependence of the functions is important. And so we determine x 
and X2 by integrating in both space and time. That is, 

x(a, t) =a + ex (ea, ei) 

X2{a, t) =a + ex (ea, et) + e^/'^E{t) + e^^/'^p{a, t). 

with 

ex(ea,et) = ( / n (0, s) ds + / Lz{w,t)dw 
Jo 

pa 

gii/2^= / LR'{w,t)dw 
Jo 

The functions satisfy the following estimates for all t E [0,Toe^^]. 
|ex(ea,et)| <Ce 
\e'>/^E{t)\ < Ce5/2 

The first estimate follows from similar estimates in the previous Lemma, 
the second is the naive bound and the final follows from following simple 
fact: 
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Fact 2. If p{a) = Jq r{a)da, where r € L? , then 

\p{a)\ < \/HlkllL2- 

In what follows we will make strong use of the fact that E{t) does not 
depend on a. 

Using the same techniques as were used in proving Lemma |H1 one can 
show that: 

x^H^,t) = x-\C,t) - e'/^E{t) + 611/^2(0 

where 1/92(ck)| < y^Q^II-^^IIl^- This sort of estimate carries over to the 
functions h. That is 

/i2(a, t) =h{a, t) + e>/'^E{t) + e'^/'^pi{a, t) 

where |p3(a, t)|, |p4(a, t)| < y^|a|(||i?^||j^2 + ||i?^||^2) over the long time scale. 
Now define Q2/ = / o ^2- So 

5i(z + eiV2i?^,y + eii/2)/-5i(z,y)/ 

=Q2oK^oQ-^f-QoKooQ-^f 

=Q o (Q-i oQ2oKqoQ~^oQ- Ko) o Q-i/ 

Since Q and its inverse are bounded operators from Ef^ to H^, we need only 
prove the estimate for the operator 

where Q = Q~^ o Q2. Notice that Q o Kq o Q^^ is the Hilbert operator K 
for a domain with the "/i" function given by h{a) = h2{h~^{a)). Moreover, 
from the above calculations for h and /12 we have, 

h2ih-\a)) = a + e5/2^(t) + e^^/^p5ia,t) 

with p5 satisfying the same type of estimates as p^. 

At this point we can make an appeal to Lemma 3.14 on p 1500 of j31j . 
In this lemma they prove that ||S'i(z,y)/||s < C(||2;||s + ||y||s) ll/lls- In the 
course of their proof, they show that if h{a) = a + g{a) then 

IIQ o Ko o Q-V(-) - i^o/(-)ll. < C\\dag\\s~i\\daf\\2- 

(See the inequalities in Cases I-IV on pp 1501-1506.) So if we set g = 
e'i'^E{t) + e^^''^p^{a,t)^ we see that taking a spatial derivative leaves us 
with 

da~g = 0(e"/2). 

And so, if we keep in mind that /(a, t) = e^F{e{a it t, e^t), 

IIQ o Ko o Q-V(-) - ^o/(-)IU <C\\do.~g\\s^i\\dc.fh 

This completes the proof of Proposition ID D 
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6. The error estimates 

In this section we prove that the approximation is rigorous. That is we will 
prove Theorem 121 We will be working with the three and four dimensional 
formulations of the water wave problem (equations ()WW3|) and ()WW4|) ). 
From fSTI, we know that for initial data of the type we are considering, 
solutions to these equations exist over the long times we are considering. If 
(z,y, n) is a solution to (|WW3|) . let: 

z{a,t) ='^'{a,t)+e^^^^R'{a,t) 

(28) y{a, t) =^^(a, t) + e^^/^Ry{a, t) 

u{a, t) =^"(a, t) + eii/2^"(a, t) 

with the functions ^ defined as above. We call R^, R^ and i?" "error" 
functions and we denote R = {R^ ^R^ , i?"). Our goal will be to show that R 
remain 0(1) in ^'^ = H'^ x H^ x H'^^^''^ over the long time scale. If we can 
do this, then we will have proven the main theorem. The first step will be 
to determine the equations which these functions satisfy. Loosely, we want 
to be able to write for each of the error functions an evolution equation of 
the form 

dtR = quasilinear + small and smooth. 

We will at times go to great lengths to achieve this! 
Clearly, 

dtR' = KoR"". 

Finding the equations for R^ and R^ is a bit more complex. First we focus 
on Ry. Substituting from (|28() into dty = K{z,y)u, we have 

dtRy 
= K{^' + €^^/'^R',^y + e^^^^Ry)R'' 

+ £-1^/2 fj^^^z ^ gll/2^^^ ^y ^ ^n/2-^y^^u _ g^^y 

= KoR"" + Afi(^^)9„i?" - {^y + Koi^y Ko))daR'' + Ny 

where 

Ny 

+ e^ii/2 ffsii^' + eii/2^^ ^y + e^^/^Ry) - 5l(^^ ^^)) ^" 

+ (k{-^'' + ell/2^^ -^y + e^^''^Ry) -Kq- Ki{^',^y)) i?". 

We claim that Ny is "small". That is, we have: 
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Lemma 11. For all Cr > 0, there exists eo such that for all e £ (0, eo) and 
t such that supo<(/<4 ||^(-, t')||j=|S < Cr we have: 



Proof. First we remark that the approximating functions ^, and their deriva- 
tives are all bounded over the long time scales. Thus, we will not be keeping 
track of the dependence of the norm of A^^ on the norms of these functions. 
By Proposition 121 we know that ||e~-'^^/^ReSy||s < Ce^. 
We can bound 

e-"/2 fsii^' + e^^/'^R\ -^y + e^^f^Ry) - 5i(^", ^^)) ^" 

by Lemma HJ 
Finally, 



<C (e^WRUs + e^^/'^\\R\\%s 



by the estimates on K and its expansions which we saw in Sections El and [5] 
(in particular Lemma and Lemma (HI). □ 

Now we discuss R^. We know that 

(29) dtu{l + Lz) + d^y{l + djy) = 0. 

We would like an evolution type equation for R^. Notice that since dty = 
K{z, y)u, there is a "hidden" dfU in the term d'^y. Recall that the commu- 
tator [dt, Si{z,y)]u can be bounded independently of dfU (see LemmaJHlin 
Section jSJ. Therefore, we can rewrite the above as 

(1 + Lz + da^yKiz, y))dtu + 9„y(l + [dt, Si{z, y)]u) = 0. 

Substituting in for u from (|28|) the above becomes 

= (1 + Lz + do,yK{z, y)) Sie"/^^" 
+ {l + Lz + do^yK{z,y))dteH'^ 
+ dc,y{l + [dt,Si{z,y)]^^) 

+ d^y([dt,Si{z,y)]e^y^R-\ 

or rather 

= (1 + Lz + do,yK{z, y)) Sie"/^^" 
+ (1 + Lz)5te2^" 
+ d^y{l + dt{K{z,y)^^)) 

+ d^y{[duSi{z,y)\e^^l^R^). 
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We rearrange this a bit, and break up y and z. 

= (1 + Lz + do^yKiz, y)) dtR"" + a„i?^ 

+ LWdt^^ + d^Rydt{K{z, y)^") + d^y[dt, Si{z, y)]R^ 
+ e""/2 ((1 + ^^.) Q^^2^u ^ Q^^^y (^ ^ dt{K{z, y)*"))) . 

The operator 

A{z,y) = {I + Lz + dayK{z,y)) 

is invertible since K(z,y) is a bounded operator on H'^, provided z and y 
are small (which they are). Moreover, we can approximate A^^ via the 
Neumann series. Thus the above equation can be rewritten as 

dtR'' = -(1 - e'^Wi)daRy + A^" 

where iV" = iVf + N^ and 

A^i" = - e-"/2^-i ((1 + L^') dte^^"" + 9«^^ (1 + ^^(^(z, y)^"))) 
N^ = - A-' {LR'dt^^ + a«i?^at(i^(z,y)^") + S^yi^i, 5i(z,y)]i?«) 
+ {-A'^ + (1 - e^VFi)) a„i?^. 

Lemma 12. For a// (7^? > 0, there exists eo stic/i that for all e G (0, eo) and 
t such that supg<(/<j ||^(-, t')||^s < Cr we have 

||A^"||s-i <c(e^ + e^\\R\\^s + e^/^p] 



Proof. First we point out this estimate is in H^~^ . The loss of regularity 
here is easily seen. Both LR^ and daR^ explicitly appear in iV^, and are 
not smoothed by any operators. Thus, losing this derivative is unavoidable. 
In fact, it is easy to see that the above estimates holds for N2 by noting 
that A~^, K and [^i,^!] are bounded operators. 

Bounding N^ is also easily done once we recognize that this term is almost 
exactly e~^^'^Res„. We have 

V 1 + L^^ y ^ ^ 



,£-11/2 I 5^e2^« + a^$?/i±M^ j + N^ + iv« + AT, 
=e-ii/2Res„ + iVg" + iV4" + N^ 
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where 



ATg" =£-11/2^-1 (da^^ydt {K{z,y)^'' - K(^^^^)^")) 
N^ =£-11/2(^-1 _ (1 + L^^)-i) 

X ((1 + L^^) Ste^^" + da^y (1 + at(K(*^ $^)^"))) 



AT^- 



=e 






1 + L^ 



We bound N^ using mean value theorem arguments entirely analogous to 
those used when bounding A^^. To bound Nf, one observes that 

(1 + L*^) S^e^^" + dc.'^y (1 + dt{K{^', ^y)^"")) 

is very nearly Res„ and is thus 0{e^'^''^). N^ is clearly small, as it contains 
dfReSy. This completes the proof. D 

We need to make analogous calculations for the four dimensional system. 
Let 

a{a, t) = e^m''{ea, et) + e^^''^W{a, t) 
and Re = {R^ , R^ , R^ , R"') . This extended set of error functions lives in 

It is easy to see that 

dtR"" = R" 
but more difficult to determine the evolution of R"^. We begin by taking a 
time derivative of (|29|) . 

(30) dM'^ + Lz) + dtudc^u + dadty{l + dfy) + d^ydfy = 0. 

Letting, 

I =dlu{l + Lz) + d^ydly 

II =dadty{l + d'fy) + dtudaU 

dSOl) is I + 1/ = 0. 

Manipulations very similar to those carried out in determing dtR^, show 
that 

I =A{z, y)dte^^l^R- + d^y[dl Si(z, y)]e^^l^R^ 
+ {I + Lz)dt€^-^'' + do,yd'i{K{z,y)-^''). 
For /I, we have 

// =(1 + dfy)dadte^^/^Ry + Otudo^e^^'^R'' 

+ (1 + dly)d^dt^y + dtud^^^ 
+ (1 + dly)d^dt^y + dtud^^^ 
+ (1 + 52y)aaai^^ + 9iua«M/" + e"/^^// 
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where 

Bii = dtu{i + KDdo.R'' + dhdo,{Ki{^>' , ^>y)K' + iv^) + 9„iv^. 

Noting that d^y = Kqu + [dt, Si]u + Sia, we see that that Bjj is smooth in 
the error functions, and is 0(e'^). 
Adding / and // gives 



(31) 
where 



=Aiz, y)dtR'' + (1 + d^y - dtuKo-)KodaR'' 



B =Bn + day[dl Si{z, y)]R^ + Bj^^ 
e^i/^SRes =(1 + Lz)dte'^^^ + dayd^iK{z, y)^") 
+ (1 + d^y)dadt^y + 9in5„^". 

The terms Bu and day[d^, Si{z, y)] are small and smooth, and we can bound 
-Bptes via the residual estimates, much as we did for N^^ above. That is, we 
have 



B\\s-i <c(e^ + e^WRUs + e^^/^||i?| 



2 



under the same hypotheses as in the above Lemmas. 

At this time, it is tempting to simply invert A(z, y). Though we could do 
this, the inverse of this operator is not smoothing. In particular the presence 
of the term dayK^ in A will cause problems. We can eliminate Kq to highest 
order by letting Ho{z,y) = {1 + Lz — dayKo) act on (|ST|) . We have for the 
first term 

Ho{z,y)A{z,y)dtR'' 
=(1 + LzfatR"" + (1 + Lz)d^yKodtR'' - d^yK^ ((1 + Lz)dtR'') 

- d^yKo (dc^yKodtR'') + Ho{z, y)(9,y5i(z, y)dtR'') 
= ((1 + Lzf + {d^yf + Hi{z, y)-) dtR"" 

where 

Hi{z,yy =day {[Lz,Ko] ■ -Ko[day,Ko] ■ -{l + K^)day) 
+ Ho{z,y) {daySi{z,y)-) . 

Notice that Hi is made up of smoothing operators, and is thus a smoothing 
operator. 
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Now, for the second term in (j3^ we have 

Ho{z, y){l + dfy - dtuKo-)Kodo^R^ 
=(1 + Lz){l + d^y)KodaR'' - (1 + Lz)dtuKldo,R'' 

- dayKo ((1 + d^y)KodaR'') + d^yKo {dtuK^d^R'') 
={l + Lz){l + d^y)KodaR'' 

- day[Ko,d^y]KodaR'' + dayKo {dtuKld^R'') 

+ {dtu{l + Lz) + 9«y(l + d^)) KidaR"". 

Notice that by comparing the last hne of the above with (|29|). we see that 
it is identicahy zero! One more rearrangement of this yields 

((1 + Lz){l + d^y) - daydtu) KodaR"" + ^2 

where 

B2 = day {-[Ko, d^y] + [Ko, dtu]Ko + 9*^(1 + Ki)) Kod^u 

is a smooth and small function by Lemma El in Sectional 
If we let 

g={l + Lz){l + d''ty)-daydtu 
then we have transformed (|I-{1|) into 

=(1 + fHi{z, y))dtR^ + fgKodaR'' 
+ fH^{z,y){da{K^{^\-^y)R-)) 
+ fiB2 + Ho{z,y)B). 
By the Neumann series, 

oo 

(1 + fH^{z, y)-)-' = 1 + Y^i-irrH^z, y) ■ . 

n=0 

Since Hi is smoothing, this is the identity plus a smoothing piece. Let 

oo 
n=0 

Thus, 

=5iii" + fgKodaR'' 

+ fHoiz, y) (da (i^l(^^ M/^)i?")) - iVf - iVf 

where 

-iVf =H2{z,y) ifgKodaR'' + fHo{z,y) (5, (i^l(*^ *^)i?"))) 

-N^ =(1 + i72(^, y))f {B2 + Ho{z, y)B) . 

Finally we rewrite the above as 

dtR" = - (I + Koa - Lz + N^) daKoR"" - d^ (i^l(^^ ^^)i?") + A^"" 
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with 

K =f9 - (1 + Koa - Lz) 

AT" =N^ + N^ + N^ 

Nt = - ifHo{z,y) - 1) (d^ (Kl(^]/^^^')i^-)) . 

Notice that {I + Kqu — Lz) is the first order approximation to fg. And thus, 
using techniques exactly hke those we used in proving the bounds on N^ 
and N^, we have the following. 

Lemma 13. For all Cr > 0, there exists eo such that for all e G (0, eo) and 
t such that supo<(/<4 ||^e('ii')llfl? ^ ^R ^^ have 



^^"ll II AT"-\\ \ <r^ r< l ^3 _, 2\\ r> n _i ll/2\\ r) ||2 



max- 

Recapping, we have shown that the three dimensional system may be 
rewritten as 

dtR^ = K^R" 

(32) dtRy = KoR'' + Mi(^^)9„i2" - (*^ + i^o(e'*^i^o))9ai?" + A^^ 

5ti?" = -(1 - e^Wi)daRy + iV" 
and the four dimensional system as 
dtR^ =KqR'' 
dtRy =KqR'' + Mi(^^)a„i?" 

- (^^ + KQ{eHyKo))d^R^ + Ny 
^■^"^^ dtR'' =R- 

dtR'' = -[i + Koa-Lz + N^) daKoR"" 
-da{Ki{^',^y)R'')+N''. 

We remark now that these are only cosmetically different than the equa- 
tions which determine the evolution of the error for the KdV approximation 
alone in [2j. See p 1524 for the equations in three dimensions and p 1526 
in four. Their variables 

(Zi,X2,[/i,yi) 

correspond to our 
and their functions 



{z,y,u,a), 



are our 



{N^,Ny,N'',N^,N''). 

The only difference of note is that their estimates contain a term they call 
q{t) while ours do not. This term, which is related to the interaction of the 
left and right moving wavetrains, has been removed in this paper by the 
inclusion of the function W3 in the approximating functions ^. This sim- 
plification does not adversely affect the means which they employ to prove 
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that the error functions remain 0(1) over the long time scale. Therefore we 
appeal to their results on pp 1524-1533. That is, 

Proposition 5. For all Tq > 0, s > 4 and Cj > 0, there exists eo such that 
for alio < e < cq, the unique solution R^ of iclcl\) with initial conditions such 
that 

\\Re{;0)Us<Ci 
satisfies 

sup \\Re{;t)Us<C 
te[0,Toe-3] 

where C is independent of e. 

Implicit in the above Proposition is the assumption that the initial con- 
ditions for the water wave problem have the form: 

z(q,0) \ / M/^(a,0) \ 

yia,0) = ■^y{a,0) +e'^I^Ro{a). 

u(a,0) / \ M/«KO) / 

So we see that this Proposition immediately proves Theorem |21 

Now that we have this result, there are a few small steps, and one big 
step, needed to prove Theorem El The first simple step is to note that the 
z is not a very physical coordinate and that we would prefer estimates for 
Xa. Since L is a bounded operator and gives the relationship between both 
z and Xa and ^^ and ^ , we have automatically 

sup \\xa{-,t)-^>\-,t)\\H^ < Cell/2. 

te[0,Toe-3] 

Secondly, the expressions for ^'^, ^^ and ^" contain terms of O(e^). 
These terms were needed to make the residual sufficiently small, but they 
are unneccessary now. Moreover, the appearance of the operator L~^ and 
inverse derivatives in the definitions of ^^ and ^" is not very intuitive. And 
so, it is a simple consequence of Lemmas and the triangle inequality that 

ll^'^-V^'^lls <Ceii/2 

||*''-^''||s <Ceii/2 

where ip , tp^ and V" were given in the Introduction in equations Q), © 
and ©. And so we have the corollary 

Corollary 1. // the initial conditions for iWW}) are of the form 

Xa{a,0)\ / iJ^{a,0) 
(34) I y{a,0) = V^^a^O) \+e'^/^Riia) 

u{a,0) J \ ^^"(0,0) 
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with \\Ri\\sjs < Cj then the solution of liWW\) satisfies the estimate: 

n(.,i) / \ r(-,t) / ^s 

for t G [0,Toe~^]. The constant Cp does not depend on e. 

Finally, we must deal with initial conditions. Recall from the discussion 
in Section [21 that it is typical to specify the initial data for the water wave 
problem in the long wavelength, small amplitude limit by 

(35) ixa{a, 0),y{a, 0),u{a, 0)) = (0, e^0y(ea), e^6„(ea)). 

However, the above results are applicable if the initial data is of the form 
seen in ()c{4() . We eliminate this discrepency by altering the initial parame- 
terization of the free surface. What should this change be? Clearly, 

(36) a = a + x{a,0). 

Now set U{P,0) = UoW), V{f3,0) = Vo{(3), F{p,0) = Fo{p), G{p,0) = 
GoiP), and P(/3,0) = 0, and let 



a 



where 



(37) 



eXi{a) 
€^X2{a) 



=a+ I -0(0,0)^0 
/o 

a + eXi{ea) + e^X2{ea) 

(Uoiea) + Voiea)) da 
(Fo(ea) + Go(ea)) da. 



With this definition, we clearly have satisfied the first condition in (|34)1 . We 
also want 

ej,(ea) = eV(a) + 0(e^^/2) 

G„(ea) = e2^"(a)+0(eii/2) 

or rather 

e^Qyiea + e'^Xi{ea) + e^X2(ea)) = eV^(a) + 0(e"/^) 
e^euiea + e^Xi^ea) + e^X2{ea)) = e2^"(a) + 0{e^^^^). 
Applying Taylor's theorem we have 

Qy + e^XiQ'y ={Uo + Vo) + e^i^dj^Uo + ^dj^Vo) 

+ e\Fo + Go) + e\Uo + Vo)^ 

Ou + e^Xie; ={Uo - Vo) + e\ld}_Uo - IdlVo) 



+ e\Fo-Go) + e\^U^ 
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We can 


solve the above 


by taking 




(38) 






Uo = l/2{ey + G„) 








Vo = 1/2(9^ - e„) 




and 










(39) 






Fo = l/2{hy + K) 








Go = l/2{hy - K) 




where 












hy 


= x,e'y- 


-\dl_Uo-\dl^Vo- 


{Uo + Vo)^ 




hu 


= xie; 


-\dl_Uo + \dl^Vo- 


^TT'i -u "^1/2 



The functions Uq, Vq, Fq and Go are all in H'^{A), and so the use of 
Taylor's theorem is justified by Lemma [TUl So we have proven: 

Lemma 14. Given initial conditions for the water wave equation in the 
form ^, define Uq, Vq, Fq, Gq, Xi and X2 as in ^, ^ and CT - 
Then the reparameterization of the initial profile given hy: 

a = a + eXi(ea) + eX2{ea) 

results in initial conditions given hy \'J4\} - 

Remark 10. Let v? (/3±,0) = ip^ (/3±). Then this Lemma will still he true 
if we replace Fq with ipQ and Gq with ip~^ and set Fq and Gq to be identically 
zero. That is, we have some choice in the way we select the initial conditions 
for the higher order equations. 

Combining this Lemma with Corollary ^ we prove Theorem [3 So we are 
done. 

7. Assorted proofs 

Proof For Lemma El Let f^^iO = S, — 52(0- Since f~^{f{a)) = a we 
have 

a = /(") -92{f{a)), 
or rather 

(40) 52(/(«)) = 5(a) 

Notice that this relation implies ||52||l°° = HffllLo^- Taking a derivative, we 
have 

52(/(a)) 



l + g'{a) 

which implies that ||52lli°° ^ C'll9'll-L°=- If we expand the left hand side of 
H40|) by the mean value theorem we see 

g2{a + g{a)) = g{a) 

52(a) + g'2{a*)g{a) = g{a). 
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This implies g2{a) = g{a) + 0{\\g'\\Loc\\g\\ioa). Now, (|^n]) can be rewritten 
and expanded using Taylor's theorem: 

92{i) = g{r\i)) 

= 9{i-9{i) + 0{\\g'\\L^\\g\\L^)) 

= 9(0 + 9'{0i-9i0 + Odb'IlL- II^IU-)) 
+ l/2g"{C){-9{0+0{\\g'\\Lo^\\g\\L^)f 
= 9(0- 9(09' iO-E 

which completes the proof. D 

Proof. For LemmalHl Since {1 + P^)"'/^F{P) G iJ^ by the Sobolev embed- 
ding Theorem there is a C such that 



So 



F{P) <C7(l + /32)-"/2, 

/ \F{ea)\^da 

<C [ jl + (ea)2r"da 



\a\>Coe-3 

The higher derivatives are bounded in exactly the same fashion. The extra 
powers of e come from the long wavelength scaling. D 

Proof. For Lemma ^ The proof is a straightforward calculation. 

P/(.)-A/(-)llf 

{i + ey\{A{k)-Mk))f{k)\^dk 

< c l{i + ky\k''j{k)\^dk 

= C l\l + k^y\dif{k)\^dk 

= c\\d:^fi-)\\l 

The proof for long wavelength data follows immediately from this. D 
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Proof. For Lemma [21 The fact that g{ex) is bounded as such in L^ fol- 
lows automatically from the geometric series approximation. That is, since 
\e'^f{ex)\ < 1/2, we know that: 

\giex)\<C\e^f{ex)\\ 

And thus we have: 

Ue-)\\h 
<C fe^\f{ex)\Ux 

<Ce^\\f{-)\\l^ j f\ex)dx 
<Ce"^f{-)\\s. 
Now consider the LP' norm of g'{ex). A direct calculation shows that 

±g{ex) = -e'f'iex) ( (^-^^l^(^^ " 1 

Taylor's theorem shows that 

\±g{ex)\<C\e^nex)\\e^f{ex)\. 

And so, just as before we have that this is bounded by Ce^'^ (which is of 
course bounded by Ce^'^). 

We could keep on going in this fashion — showing each derivative of g is 
bounded. This is however difficult as finding higher and higher derivatives 
is a notationally taxing job — see the expression of Faa-di-Bruno for proof of 
that! 

Instead we take the following approach. Let 

h{y) = -^-l + y. 
1 + 2/ 

For y € [—1/2, 1/2], this function is real analytic and there exists another 
function h{y) (real and analytic on the same interval) such that h{y) = 
yh{y). Now, define he{Y) = h{e'^y). We have that: 

II^e(-)llc=[-l/2e-2,l/2e-2] < \\h{-)\\c^[-l/2,l/2]- 

The point here is that the C^ norm of ft-e can be bounded independently of 
e. 

Now notice that g{X) = e^ f {X)h,{f {X)) . Since / G H'+^ , we know 
that / G C^. This implies that h^{f{X)) £ C\ with C norm bounded 
independent of e. Thus we have f {X)h^{f {X)) G H^ , with a norm bounded 
independent of e. With this in hand, we have that ||(7(-)||s < Ce^, with 
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C 7^ C(e). Now, the derivatives of g can be bounded as follows: 

=e"lb(")(e-)llL2 
<Ce"-V2||5(.)||^ 

Provided n > 2, this term is small enough. And so we have shown that 
the first s derivatives are sufficiently small in L^, and we have proved the 
Lemma. D 
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